PITCH ESTIMATION FRAMEWORK FOR SPEECH SEGREGATION USING COCHLEAGRAM MORPHING
نویسندگان
چکیده
Computational auditory scene analysis (CASA) has significant role in speech segregation from monaural audio mixtures and generally a measure for performance of recognition systems. Pitch estimation substantial CASA This study presents novel pitch framework using cochleagram morphing. The proposed takes the rough target given containing background interferences. Discrete set consisting morphed versions is obtained k-Means clustering. estimated values are improved by validating smoothing them to cochleagram. Measure refined contours along with harmonicity temporal continuity used segregate speech. produced 83.13% accuracy MIR-1k dataset which considerably higher than existing methods.
منابع مشابه
Speech segmentation in synthesized speech morphing using pitch shifting
This paper discusses the speech morphing process showing some limitations of using the directly obtained LPC and excitation parameters of speech. The algorithm here depends on changing the pitch of the source to match that of the target based on analyzing the speech signals to its basic components. Different experiments for changing the female to female, male to male, male to female and female ...
متن کاملPitch-based monaural segregation of reverberant speech.
In everyday listening, both background noise and reverberation degrade the speech signal. Psychoacoustic evidence suggests that human speech perception under reverberant conditions relies mostly on monaural processing. While speech segregation based on periodicity has achieved considerable progress in handling additive noise, little research in monaural segregation has been devoted to reverbera...
متن کاملMonaural Speech Segregation Based on Pitch
Introduction The goal of the proposed algorithm is to separate speech signals in monaural recordings even in very adverse conditions when significant background noise and additional speakers are present at the same time. Particularly we try to decide for each time frequency region which of the different sound sources dominates and then build for each sound source a binary mask which is one at t...
متن کاملEfficient Method of Pitch Estimation for Speech Signal Using MATLAB
In this paper, we are estimating the pitch of telephone speech signal. We use different types of methods such us, Burg, Covariance, Fast Fourier transform, Modified Covariance, Multiple Signal Classification (MUSIC) algorithm or Eigen vector, Multi Taper method (MTM), Welch, and Yule Auto Regressive (Yule AR) , to estimate the PSD using signal processing tool box of MATLAB. The spectrum was con...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pakistan journal of science
سال: 2023
ISSN: ['0030-9877', '2411-0930']
DOI: https://doi.org/10.57041/pjs.v67i4.605